Autonomic Solutions for Scalable, Reliable Computing
نویسنده
چکیده
The trend toward larger high-performance computing (HPC) systems is leading to an increased demand for scalable software. However, HPC systems are also becoming more complex making the development and use of scalable software increasingly difficult. As a result, very few applications run effectively (or at all) at extreme scales. Furthermore, many application problems are manifested only at large scales, and tools that can diagnose effectively functional and performance complications in these applications are also lacking. With the availability of hundred thousand processor systems and the advent of million processor systems, the lack of scalable software systems is increasingly problematic. This problem is particularly concerning for capability computing where the most powerful computational resources are used to solve large, demanding problems.
منابع مشابه
A Scalable Self-Managing Architecture for WSRF Services
Service-Oriented Architectures provide integration of and interoptablity for independent and loosely coupled services. Web services and the WSRF standards are frequently used to realise such Service-Oriented Architectures. In such systems, autonomic principles of self-configuration, self-optimisation, self-healing and self-adapting are desirable to ease management and improve robustness. In thi...
متن کاملReliability, Diagnosis – Challenges to Pervasive Computing
Pervasive computing systems are highly complex due not only to vast heterogeneity but also to mobile, ad hoc interactions. As a result, it is very costly to provide post-purchase customer support for home pervasive computing. We have investigated current approaches that might provide technologies for solutions: faulttolerant/reliable computing, self-healing/autonomic computing, and tools for te...
متن کاملReliable Multicast Protocol Specialization for Caching and Collaboration within the World-Wide Web
The World Wide Web (WWW) has become an important medium for information dissemination. One model for synchronous information dissemination is a scheme called webcasting where data is simultaneously distributed to multiple destinations. The WWW's traditional unicast client/server communication model su ers, however, when applied to webcasting; solutions which require many clients to simultaneous...
متن کاملGrassroots Approach to Self-management in Large-Scale Distributed Systems
Traditionally, autonomic computing is envisioned as replacing the human factor in the deployment, administration and maintenance of computer systems that are ever more complex. Partly to ensure a smooth transition, the design philosophy of autonomic computing systems remains essentially the same as traditional ones, only autonomic components are added to implement functions such as monitoring, ...
متن کاملTowards Autonomic Hosting of Multi-tier Internet Applications
Large scale e-commerce enterprises like Yahoo and Amazon use complex software systems made of hundreds of Internet services to serve content to millions of clients. These services are multi-tiered Web applications that perform certain business logic and are exposed through well-defined client interfaces usually accessible over the network. A constant challenge faced by these organizations is to...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007